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(N ' 

, We estimate the resource requirements, the total number of physical qubits and computational 

time, required to compute the ground state energy of a 1 -D quantum Transverse Ising Model (TIM) 



(N 
> 



of N spin- 1/2 particles, as a function of the system size and the numerical precision. This estimate 
is based on analyzing the impact of fault-tolerant quantum error correction in the context of the 



(— I , Quantum Logic Array (QLA) architecture. Our results show that due to the exponential scaling of 

the computational time with the desired precision of the energy, significant amount of error correciton 
^ ' is required to implement the TIM problem. Comparison of our results to the resource requirements 

^ ' for a fault-tolerant implementation of Shor's quantum factoring algorithm reveals that the required 



logical qubit reliability is similar for both the TIM problem and the factoring problem. 



I. INTRODUCTION 



00 ' The calculation of the basic properties of quantum systems (eigenstates and eigenvalues) remains a chal- 

o : 

lenging problem for computational science. One of the most significant issues is the exponential scaling of 
^ , the computational resource requirements with the number of particles and degrees of freedom, which for 

■ even a small number of particles (~ 100) exceeds the capabilities of current computer systems. In 1982 

Feynman addressed this problem b y p roposing that it may be possible to use one quantum system as the 
basis for the simulation of another Uj]. This was the early promise of quantum simulation, and one of the 
original motivations for quantum computing. Since that time, many researchers have investigated different 



approaches to quantum simulation 



^|7|]. For example, Abrams and Lloyd have proposed a quan- 



tum algorithm for the efficient computation of eigenvalues and eigenvectors using a quantum computer [41]. 
Many of the investigations into quantum simulation have assumed ideal performance from the underlying 
components resulting in optimistic estimates for the quantum computer resource requirements (number of 
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qubits and time to completion). It is well known, however, that in order to address the effects of decoherence 
and other sources of faults and errors in the implementation of qubits and gates it is necessary to incorporate 
fault-tolerant quantum error correction into an estimate of the resource requirements. 

In this paper we estimate the resource requirements for a quantum simulation of the ground state energy 
for the 1-D quantum Transverse Ising Model, specifically incorporating the impact of fault-tolerant quantum 
error correction. We apply the general approach of Abrams and Lloyd |l3L |4|], and compute estimates for 
the total number of physical qubits and computational time as a function of the number of particles (N) and 
required numerical precision (M) in the estimate of the ground state energy. 

We have chosen to study the resource requirements for computing the ground state energy for the 1-D 
quantum TIM since this model is well studied in the literature and has an analytical solution isl ^ fiol . 
The relevant details of the TIM are summarized in Section Ull In Section Hill we map the calculation of 
the TIM ground state energy onto a quantum phase estimation circuit that includes the effects of fault- 
tolerant quantum error correction. The required unitary transformations are decomposed into one qubit 
gates and two-qubit controlled-not gates using gate identities and the Trotter formula. The one-qubit gates 
are approximated by a set of gates which can be executed fault-tolerantly using the Solovay-Kitaev theorem 
Ini . In Section ITlI CI the quantum circuit is mapped onto the Quantum Logic Array (QLA) architecture 
model, previously described by Metodi, et al. il2l] . Our final results, utilizing the QLA architecture, are 
given in Section IIII Dl and a discussion of how improving the state of the art in the underlying technology 
affects the performance for executing the TIM problem. In Section |IVl we extend our resource estimate 
from 1-D to higher dimensions. Since the QLA architecture was developed to study the fault-tolerant 
resource requirements for Shor's quantum factoring algorithm lisll . we compare our present results for the 
TIM quantum simulation with previous analysis of the the resource requirements for Shor's algorithm, in 
Section V. Finally, our conculsions are presented in Section |Vll 



II. TRANSVERSE ISING MODEL 



The 1-D Transverse Ising_^Model is one of the simplest models exhibiting a quantum phase transition 



at zero temperature 



ML 



15n . The calculation of the ground state energy of the TIM varies from 
analytically solvable in the linear case |8] to computationally inneficient for frustrated 2-D lattices I l3l . 
For example, the calculation of the magnetic behavior of frustrated Ising antiferromagnets requires com- 



putationally intensive Monte-Carlo simulations 



Given the difficulty of the generic problem and the 
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centrality of the TIM to studies of quantum phase transitions and quantum annealing, the TIM is a good 
benchmark model for quantum computation studies. 

The 1-D Transverse Ising Model consists of A^-spin-1/2 particles at each site of a one dimmensional 
linear lattice (with the spin axis along the z-axis) in an external magnetic field along to x-axis. The Hamil- 
tonian for this system, Hf, may be written as 

where J is the spin-spin interaction energy, T is the coupling constant and related to the strength of the 
external magnetic field along the x-direction, and implies a sum only over nearest-neighbors, af and 
af are the Pauli spin operators for the ith spin, and we set ?i = 1 throughout this paper. 

In present work we focus is on the 1-D linear chain TIM of N-spins with constant Ising interaction 
energy Jij = — J. The ground state of the system is determined by the ratio of g = T/J. For the large 
magnetic field case, g » 1 the system is paramagnetic with all the spins aligned along the x axis, and 
in the limit of small magnetic field, 17 << 1, the system has two degenerate ferromagnetic ground states, 
parallel and anti-parallel to the z axis. In the intermediate range of magnetic field strength the linear 1-D 
TIM exhibits a quantum phase transition at 5 = 1 ||9|]. 

The TIM Hamiltonian in Equation [H for the 1-D case with constant couphng can be rewritten as: 

(N N-1 \ 

Y,9Xj + Y.Z,Z,+i\ (2) 
i=i i=i / 

where the PauU spin operators are replaced with their corresponding matrix operators Xj, Zj. For the 1-D 
TIM, the ground state energy can be calculated analytically in the limit of large N[8]. In the case of a 
finite number of spins with non-uniform spin-spin interactions (J not constant), it is possible to efficiently 



simulate the TIM using either the Monte-Carlo method III8II or the density matrix renormalization group 
approach [10]. The challenge for classical computers comes from the 2-D TIM on a frustrated lattice where 
the simulation scales exponentially with N. Applying the quantum phase estimation circuit to calculate the 
ground state energy of the TIM requires physical qubit resources, which scale polynomially with N, and 
the number of computational time steps is also polynomial in N. In addition, just as the complexity of the 
problem is independent of the lattice dimension and layout when applying classical brute force diagonaliza- 
tion, the amount of resources required to apply the quantum phase estimation circuit is largely independent 
of the dimensionality of the TIM Hamiltonian. 
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m. TIM QUANTUM SIMULATION RESOURCE ESTIMATES 

Our approach to estimating the resource requirements for the TIM ground-state energy calculation with 
Hamiltonian Hi involves two steps. First, we follow the approach of Abrams and Lloyd and map the 
problem of computing the eigenvalues of the TIM Hamiltonian in Equation |2] onto a phase estimation 
quantum circuit ij, |4|]. Second, we decompose each operation in the phase estimation circuit into a set of 
universal gates that can be implemented fault-tolerantly within the context of the QLA architecture. This 
allows us an accurate estimate of the resources in a fault-tolerant environment. 



A. Phase estimation circuit 

The phase estimation algorithm allows one to calculate an M-bit estimate of the phase cj) of the eigen- 
value e"^^^*^ of the time evolution unitary operator U (r) = e~^^^^ , where the time r is constant through- 
out the implementation of the phase estimation algorithm. The desired energy eigenvalue E of Hj can be 
computed using (j) by calculating E = 

The value of r is determined by the fact that the output from the phase estimation algorithm is the binary 
fraction O.xi ... xm, which is less than one isLlj]. In order to ensure that this result is a valid approximation 
of the phase 0, we must set the parameter r such that r < 2it/ E, which corresponds to i;^ < 1. For the 1-D 
TIM, the magnitude of the ground-state energy \Eg \ is bounded by NJ{1 + g) |8]. In the region near the 
phase transition 5 « 1, we choose t={10JN)^^ , which satisfies r < 2TT/\Eg\. 

The quantum circuit for implementing the phase estimation algorithm is shown in Figure [T] The circuit 
consists of two quantum registers: an A^-qubit input quantum register prepared in an initial quantum state 
l^*), and an output quantum register consisting of a single qubit recycled M times |Q,Hl- Each of the iV 



qubits in the input register corresponds to one of the N spin- 1/2 particles in the TIM model 12111 . At the 
beggining of each of the M steps in the algorithm, the output qubit is prepared into the state -^(lO) + |1)) 
using a Hadamard (H) gate. The H gate is followed by a controlled power of U{t), denoted with U (2'™t), 
applied on the input register, where < m < M — 1. 

Letting j denote to the jth step in the circuit, each time the output qubit is measured (meter symbols) 
the result is in the mth bit in the estimate of (p, following the rotation of the output qubit via the gate: 

i?, = 10)(0|+exp ivr ^ |l)(l| (3) 

\ m=M+2-j J 
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FIG. 1 : The circuit for implementing the phase estimation algorithm using one continuously recycled control qubit. 

where the gate R j corresponds to the application of the Quantum Fourier Transform on the output qubit at 
each step 11191. boll. The result after each of the M measurements is an M-bit binary string {xiX2 ■ ■ ■ xm}, 
which corresponds the M-bit approximation of given by O.xi ... xm- Using this estimate of (p, the 
corresponding energy eigenvalue E = will be the ground-state energy Eg with probability equal to 

n ^ 

[("Jfl^g)!^ Ol], where \"^g) is the ground eigenstate of Hj. 

To maximize the probability of success the initial quantum state |^) should be an approx- 

imation of the ground state j^g). For arbitrary Hamiltonians the preparation of an approximation to \"^g) 



is generally computationally difficult i22l 



2311 . For certain cases, the preparation can be accomplished us- 



ing classical approximation techniques to calculate an estimated wavefunction or adiabatic quantum state 



preparation techniques 116, 



2lh . If the state can be prepared adiabatically, the resource requirements for 



preparing j^) are comparable in complexity to the resource requirements for implementing the circuit for 
the phase estimation algorithm shown in Figure[T][21]. For this reason, we focus our analysis on estimating 
the number of computational time steps and qubits required to implement the circuit, assuming that the 
input register has been already prepared into the iV-qubit quantum state 



B. Decomposition of the TIM quantum circuit into fault-tolerant gates 

Figure [T] in Section ITlI Al shows the TIM circuit at a high-level, involving + 1 unitary operators. In 
this section, each unitary operation of the circuit is decomposed into a set of basic one and two qubit gates 
which can be implemented fault-tolerantly using the QLA architecture. The set of basic gates used is 

{X,Z,iJ,r, 5, CNOT, Measure} (4) 

where MEASURE is a single qubit measurement in the z basis, CNOT denotes the two-qubit controUed- 
NOT gate, and T and S gates are single-qubit rotations around the z-axis by 7r/4 and it/ 2 radians respec- 
tively. The high-level circuit operations which require decomposition are the controlled- C/( 2™ r) gates and 
each Rj gate. 
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FIG. 2: Circuit for the controlled unitary operation U{2"^t) approximated using the Trotter formula. 



The Controlled-?7(2'"r) gate can be decomposed using the second-order Trotter formula 



2M- First, 



Hi is broken into two terms: Hx = "^jLoQ^j^ representing the transverse magnetic field, and Hzz 
J2f=o^ ^j^j+i^ representing the Ising interactions. By considering the related unitary operators 

N 

U^{2t) = Ylexpi-igrXj) 

i=i 

N-l 

UzzC^t) = Y[ exp(-irZjZj+i), 



(5) 
(6) 



where we set g = I, as discussed in Section |lll We can construct the Totter approximation of U{2"^t), 
denoted by [7(2™r) as: 



= c7(2-r) + eT, 

(2™t/A;) and is the Trotter approximation error, which scales as O ( 



(7) 



\M- The 



where 6 

Trotter approximation error can be made arbitrarily small by increasing the integer Trotter parameter k. 
Since the controUed-C/ (2'"r) corresponds to the {M — m)th bit, must be less than 1/2^^""*, which is the 
precision of the (M — m)th measured bit in the binary fraction for the phase 0. Thus, when approximating 
U (2'"r), k is increased until er is less than 1/2^^-'". For a given M, we estimate a numerical value for the 
Trotter parameter A;(m = 0) = A;o as a function of < 10, with the constraint that < 1/2*^. We thus 
find that for fixed M, ko scales as 1 /N. We use the exponent based on < 10 to extrapolate ko for larger 
N. For m > 0, we set k = 2"^ko, which will satisfy the error bound based on the scaUng of with k. 
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FIG. 3: The decomposition of the controlled unitary operation Ux{0) into single-qubit gates and CNOT gates. 
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FIG. 4: The decomposition of the controlled unitary operation UzzC^O) gate into single-qubit gates and CNOT 
gates. 

The circuit corresponding to the Trotter approximation of U{2"^t) is shown in Figure|2j where it can be 
seen that the controIled-C/(2"^r) is composed of two controlled-C/^; (^) operations and a controIIed-C/^^^ (0) 
operation, repeated k times and controlled on the mth instance of the output qubit denoted with Qm- Ex- 
panding the circuit in Figure|2l we can express {7(2"^t) as: 



(8) 



which shows that, approximating U{2'^t) will require the sequential implementation of k controlled- 
gates, {k — 1) controlled-{72,(2^) gates, and two instances of controlled- (0) gates, all controlled 
on the mth instance of the output qubit. 

The quantum circuits for the decomposition of the controlled-C/a; (20) and contxo\\&A-Uzz{26) gates are 
shown in Figures |3] and |4l respectively. The gates are decomposed into rotations about the z-axis, Rz{0) = 
exp(— and CNOT gates. {N — 1) additional qubits are used to prepare an A^-qubit cat state in order 
to parallelize each of the N Rz{9) gates. The preparation of an A^-qubit cat state requires (A^ — 1) CNOT 
gates, which can be implemented in 0{N) time steps in parallel with the Rz{9 /A) gates in Figure[3]and in 
parallel with the Rz{9/2) gates in Figure ID 
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The three single-qubit gates (Rz{9), Rz{0/2), and RzjO/A)) can be approximated using 



0{log^-^'^{l/esk)) basics gates {H, T,S) with the Solovay-Kitaev theorem lU 



261 . The Solovay-Kitaev er- 



ror (esk) is equivalent to a small rotation applied to the qubit. Using the resuhs of Dawson and Nielsen 12611 
and 6 = to compute the sequence of H, T, and S gates required to approximate each of the three Rz 
gates. We define Sr as the length of the longest of these three sequences. For M=30, for example, we find 
that Sr = Ax 10^, requiring a sixth order [26] Solovay-Kitaev approximation. The results of this calculation 

show that the Solovay-Kitaev error e^fc < — , in order that the total error, ex is less than the required preci- 

k 

sion (1/2^-™), when we approximate C/(2"r). As a result Sr scales as 0{log^-^'^{k/€T)) = 0{M^-^''). 

We now have a complete decomposition of the controlled-C/(2™r) into the basic gates given in Equation 
m As a function of Sr, the number of time steps required to implement controlled- C/2;(0) and Uzz{G) is 
equal to {?>Sr + 4), and {QSr + 7), respectively. Following Equation^ the number of time steps required 
to implement the entire controlled- [/( 2"" r) is A;(95j? + 11) + 35r + 4, where k = 2"^ko. Each Rj gate in 
Figure dJis equivalent to at most a rotation by Rz{6) and requires less than Sr gates. 

Putting all of the above together, the total number of time steps {K) required to implement the TIM 
circuit as a function of Sr, kg, and M is given by: 

M-l 

K = [2™A:o(95k + 11) + 35^ + 4 + Sr] 

= 0(2^0 X Sr, (M ^ oo) (9) 

Since Sr scales as 0(M^-^^), the total number of time steps is dominated by the exponential dependance 
on the precision (M). The number of qubits Q required to implement the circuit is 2N, since qubits are 
needed for the input register j^*), one qubit is needed for the output register, and A^ — 1 qubits are needed 
for the cat state. 

In the next section we include fault-tolerant QEC into our circuit model and determine the resulting 
resource requirements, K and Q. We also provide an estimate on how long it could take to implement the 
TIM problem in real-time by taking into account the underlying physical implementation of each gate and 
qubit in the context of the QLA architecture. 



C. Mapping onto the QLA architecture 



28 



3H]i 



into the TIM circuit design 



Incorporating quantum error correction and fault-tolerance 11271 
will impact the resource requirements in two ways. First, each of the qubits becomes a logical qubit, that 
is encoded into a state using a number of lower-level qubits. Second, each gate becomes a logical gate, 
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realized via a circuit composed of lower-level gates applied on the lower-level qubits that make-up a logical 
qubit. Each lower-level qubit may itself be a logical qubit all the way down to the physical level. Thus, 
quantum error correction and fault-tolerance increases the number of physical time steps and qubits required 
to implement each basic gate and may even require additional logical qubits, depending on how each gate is 
implemented fault-tolerantly and the choice of error correcting code. The resource requirements necessary 
to implement encoded logical qubits and gates will depend on the performance parameters of the underlying 
physical technology, the type of error correcting code used, and the level of reliability required per logical 
operation. The physical technology performance parameters that are taken into account in the design of the 
QLA architecture are the physical gate implementation reliability, time to execute a physical gate, and the 
time it takes for the state of the physical qubits to decohere. 

The QLA architecture Ivll is a tile-based, homogeneous quantum computer architecture based on ion 



33h . Each tile represents a single 



trap technology, employing 2-D surface electrode trap structures Oil. 1321. 
computational unit capable of storing two logical qubits and executing fault-tolerantly any logical gate from 
the basic gate set given in Equation HI One of the key features of the QLA architecture is the teleportation- 
based logical interconnect which enables logical qubit exchange between any two computational tiles. The 
interconnect uses the entanglement-swapping protocol j34ll to enable logical qubit communication without 



adding any overhead to the number of time required to implement a quantum circuit 



The QLA was originally designed based on the requirement to factor 1024-bit integers lll2h . This re 



quirement resulted in the need to employ the second order concatenated Steane |7, 1,3] quantum error 



correcting code i35h . Second order concatenation means that each logical qubit is a level 2 qubit, composed 



of 7 level 1 logical qubits each encoded into the state of 7 physical ion-trap qubits. 

To estimate the reliability for executing each of the basic-gates fault-tolerantly, a lower-bound of 3.1 x 



10^^ for the fault-tolerant threshold of the [7, 1, 3] code. This value was derived by Metodi, et al i36ll . 
by analysis of the ion-trap-based geometrical layout of each logical qubit tile. The |7, 1 , 3] code threshold 
value used in the current research differs from the previously published estimate of 1.8 x 10~^ IstI due 
to our more detailed account of the operations specific to the ion trap technology in the implementation 



of each logical qubit 113611 . Gottesman's methodology 113811 . which takes into account qubit movement, and 
these threshold results we estimate the reliability for each logical operation at levels 1 and 2. 

Since each qubit in the |7, 1, 3] code moves an average of 10 steps during error correction js^, we find 
that each level 1 gate has a failure probability of 3.2 x 10^^'^ and each level 2 gate has a failure probability 
of 3.5 X lO^^''. In our failure probability estimates, we have assumed optimistic physical ion trap gate 



error probabilities of 10 per physical operation, consistent with recent ion-trap literature 113911 . We also 
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FIG. 5: QLA architecture for the TIM problem 



determine the physical resources required for each logical qubit. Each level 1 qubit requires 21 ion-trap 
qubits (7 data qubits and 14 ancilla to facilitate error correction) and each level 2 qubit requires 21 level 1 
qubits. Given that the duration of each physical operation on an ion-trap device is currently on the order 



10 fis 140, 



4111 . the time required to complete a single error correction step is approximately 1.6 ms at level 



1 and 0.26 seconds at level 2. 

The number of logical qubits Q directly maps to the number of computational tiles required by the 
QLA, allowing us to estimate the size of the physical system. Similarly, the number of time steps K maps 
directly to the time required to implement the application since the duration of a single time step in the QLA 
architecture is defined as the time required to perform error correction, as discussed in Reference [12]. We 
define an aggregated metric KQ called the problem size equal to K x Q, which is an upper bound on the 



total number of logical gates executed during the computation 14211 . The inverse of the problem size, 1 / KQ, 



is the maximum failure probability allowed in the execution of a logical gate 14211 . which ensures that the 
algorithm completes execution at least 36% of the time. Taking into consideration the failure probabiUties 
per logical gate, the maximum problem size KQ which can be implemented in the QLA architecture is 
3.1 X 10^ at level 1 error correction, 3 x 10^^ at level 2, and 2.8 x 10^*^ at level 3. Level 3 error correction 
is not described in the design of the QLA architecture, however, its implementation is possible since a level 
3 qubit is simply a collection of level 2 qubits and the architecture design does not change. The estimated 
failure probability for each level 3 logical gate is 3.6 x 10"^^. 

The parameters K and Q for the TIM problem were estimated in Section IIIIBi where Q was found to 
be 2N and K is on the order of 0{2^^) x Sr. The fault-tolerant implementation of the T gate, however, 
requires an auxiliary logical qubit prepared into the state r| + ) for one time step followed by four time 
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FIG. 6: (color online) Numerical calculations for the number of logical cycles K (solid line) and days of computation 
necessary assuming = 100 spin TIM problem as a function of the desired maximum precision M < 20. 



steps composed of H, CNOT, S, and MEASURE gates |43], causing the value of K and Q to increase. 
Since many of the gates in the Solovay-Kitaev sequences approximating the Rz gates are T gates, when 
calculating K using Equation |9l the value of Sr must take into consideration the increased number of 
cycles for each T gate. All other basic gates are implemented transversally and require only one time step. 

The resulting functional layout for the QLA architecture for the TIM problem is shown in Figure [51 The 
architecture consists of 4N logical qubit tiles. The tiles labled with Qi through Q at are the data tiles which 
hold the logical qubits used in the A^-qubit input register \"^) and the "OUT" tile is for the output register. 
The tiles labled with Ci through Cj\j-i are the — 1 qubit tiles for the cat state. The T[ + ) tiles are for 
the preparation of the auxiliary states in the event that T gates are applied on any of the data qubits. All 
tiles are specifically arranged as shown in Figure [5] in order to minimize the communication required for 
each logical CNOT gate between the control and target qubits. For example, when preparing the cat state 
using all Ci tiles and the "OUT" tile, CNOT gates are required only between the "OUT" tile, Ci, and Cr- 
Similarly, Ci interacts via a CNOT gate only with C2, while C2 interacts only with Qs, during the cat state 
preparation. 




D. Resource estimates for the 1-D TIM problem 



The resource requirements for implementing the 1-D TIM problem using the QLA architecture are given 
in Figure [6l where we show a logarithmic plot of the number of time steps K (calculated using Equation 
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|9ll as a function of the energy percision M < 20, assuming N = 100. The figure clearly shows K's 
exponential dependence on M. The dependence of K on the number of spins (N) is negligible and appears 
only in the ko term in Equation |9] as 0{1/N), as discussed in Section ITlIBI In fact, since Q = AN, we 
expect very little increase in the value of the total problem size KQ as N increases. 

We see that for M < 8 no error correction is required. This is because the required reliability per 
gate of l/KQ is still below the physical ion-trap gate reliability of 1 x 10^^. Without error corection, 
the architecture is composed entirely of physical qubits and all gates are physical gates. This means that 
each single-qubit Rz gate can be implemented directly without the need to approximate it using the Solvay- 
Kitaev theorem, resulting in 5/? = 1 in Equation |9l and the total number of qubits becomes 2N instead 
of 4A^. For M > 9 error correction is required, resulting in a sudden jump in the number of timesteps 
at M = 9, with an additional scaling factor of 0{M^) in K due to 5/j's dependence on M. In fact, K 
increases so quickly that at M = 9 that level 2 error correction is required instead of level 1. At M > 18 
level 3 error correction is required and while there is no increase in K, each time step is much longer, so 



there is a jump in the number of days of computation. The Solovay-Kitaev order i2ql for M = 9 is three 
and increases to order five for M = 20. 



E. Discussion of the resource estimates 

Our resource estimates for the 1-D TIM problem indicate that multiple levels of error correction, even for 
modest precision requirements, results in long computational times. As shown in Figure [6l it takes longer 
than 100 days, even for M = 7, when level 2 error correction is required. When level 3 error correction is 
required the estimated time is greater than 7.5 x 10^ years. 

The number of logical cycles K, which grows exponentially with M, contributes to the long computa- 
tional times. However, the primary factor contributing to the long computational time is the time it takes to 
implement a single logical gate using error correction. Presently, it is difficult to see how one might reduce 
the value of K short of implementing a different approach for solving quantum simulation problems. On 
the other hand, the logical gate time can be improved by implementing small changes in three parameters: 
decreasing the physical gate time tp, increasing the threshold failure probability pth, and decreasing the 
underlying physical failure probabihty po. 

The effect of these three parameters on the overall computational time for the 1-D TIM problem is 
shown in Figure |7j The figure shows how the total time, in days, for M = 18 varies as we improve each 
of the three parameters by a factor of 2 during each of the 10 iterations shown. The starting values for each 
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FIG. 7: (color online) The total computation time in days as we vary the physical cycle time tp (square markers), 
physical failure probability po (starred markers), threshold failure probability pth (diamond markers), and all together 
(circular markers) by a factor of two over 10 iterations. 

parameter in the figure are 3.1 x 10^^ for pth, 10^^ for po and 10 fis for tp. Decreasing tlie physical failure 
probabiUty and increasing the threshold values by a factor of 2 during each iteration causes the number of 
days to decrease quadratically whenever lower error correction level is required, otherwise the number of 
days remains constant from one iteration to the next. A single change in the error correction level from level 
3 to level 2 occurs by increasing pffi by a factor of 2 but there is no gain from additional increases in the 
threshold alone. Decreasing only po by a factor of 512 yields two changes in the error correction level. 

From this analysis, we see that in order to reach a computational time on the order of 100 days with only 
level 1 error correction, we need to achieve parameter values of pfh = 1 x 10^"^, po = 3x 10^^, and tp = 300 
ns, or better. This provides goals for the improvement in the device technologies necessary for quantum 
simulation. It should also be noted that these parameters are not completely independent and improvements 
in one of them may result in improvements in the others. For example, improving the physical failure 
probability may lead to better threshold failure probability by allowing some of the underlying operations 
to be weighted against one another. Similarly improving the threshold failure probability, may require 
choosing a more efficient quantum error correcting code which may have fundamentally shorter logical 
time. 
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IV. GENERALIZING TO fflGHER SPATIAL DIMENSIONS 

The 1-D TIM ground state energy can be efficiently computed using classical computing resources by 
taking advantage of the linear geometry of the spin configuration and significantly reducing the effective 
state space to a polynomial in A'^ jio!]. A 2-D TIM with ferromagnetic and antiferromagnetic Ising couplings 
can be difficult to solve due to spin frustration. Many reductions to this problem still yield an exponential 
number of states with near degenerate energy |l3 l. As a result, the problem size scales exponentially with 
the size of the lattice. In contrast, the implementation of the quantum phase estimation circuit in Figure [T]is 
largely independent of the geometry of the N spin states and the values of Fj and Jij, which suggests that 
it can be used for implementing efficiently higher-dimensional TIM problems. Consider, for example, the 



4411 using the phase estimation circuit. 



calculation of the ground state energy for the 2-D Villain's model 

Villain's model is a 2-D square lattice Ising model with A^^ spin sites in which the rows have all ferro- 
magnetic coupling and the columns alternate between ferromagnetic and antiferromagnetic. Each of the A^^ 
sites in Villain's model are represented by iV^ qubits in a x grid. The only change to the circuit for the 
phase estimation algorithm is the application of Uzz Ising interaction, which must be decomposed into two 
successive steps. First the rows of spin states are treated as the 1-D TIM problem in parallel, followed by 
the columns. Since the Uzz operations within each step are done in parallel, we still require N/ 2 additional 
qubits for the cat-states. Given that the remaining operations, including the Quantum Fourier Transform 
implementation, remain the same, the increase in the number of time steps to implement an A^^-spin 2-D 
TIM problem, compared to the 1-D TIM problem, is by less than a factor of two. Similarly, the increase in 
the resource requirements between a 1-D and a 3-D TIM problem will be by less than a factor of three. 



V. COMPARISON WITH FACTORING 



Since the QLA architecture was initially evaluated in the context of Shor's quantum factoring algorithm 
lisi . it would be interesting to consider how the resource requirements for implementing the TIM problem 
compare to those for implementing the factoring algorithm. In this section, we compare the implementation 
of the two applications on the QLA architecture and highlight some important differences between each 
application. 

Even though both applications employ the phase estimation algorithm, there are several important differ- 
ences. First, the precision requirements are different. For Shor's quantum factoring algorithm, the precision 
M must scale linearly with the size A^ of the A^-bit number being factored lisi . where A^ > 1024 for mod- 
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FIG. 8: (color online) Performance characteristics of different QLA-based quantum computers in KQ space with fixed 
amount of physical resources. The binary precision for the Ising problem of M = {5, 10, 15, 20, 25} corresponds to 
decimal precision of {1, 3, 4, 6, 7} digits, respectively. 



em cryptosystems. For quantum simulations, the desired precision is independent of the system size N, and 
the required M is small compared to factoring. The second difference lies in the implementation cost of the 
repeated powers of the controlled-C/ (r) gates for each application. In Shor's algorithm, the gate is defined 
as U{t)\x) = \ax mod N). Higher order powers of the unitary can be generated efficiently via modular 
exponentiation lll3ll . The result is that the implementation of U{2"^t) requires 2m times the number of 
gates used for U{t). For generic quantum simulation problems, the implementation cost of U (2™r) equals 
2"^ times U{t), because of the Trotter parameter k. The implementation of the control unitary gates for 
quantum simulation is not as efficient as that for the modular exponentiation unitary gates. The third dif- 
ference lies in the preparation of the initial iV-qubit state l^*). The preparation of |^) for the TIM problem 
by adiabatic evolution is comparable in resource requirements to the phase estimation circuit. For Shor's 
quantum factoring algorithm |^) = |1) in the computational basis and is easily prepared. 

Finally, factoring integers large enough to be relevant for modern cryptanalysis requires several orders 
of magnitude more logical qubits than the scale of quantum simulation problems considered in this paper. 
At minimum, the factoring of an iV-bit number requires 2N + 3 qubits, using the same one-control qubit 
circuit given in Figured] [451]. As shown later in this section, however, choosing to use only the minimum 
number of qubits required for factoring leads to very high error correction overhead. A more reasonable 
implementation of the factoring algorithm requires 0{N'^) number of logical qubits, which corresponds to 
millions of logical qubits for factoring a 1024-bit number. Quantum simulation problems require signif- 
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icantly less computational space and the problems considered in this paper require less than 500 logical 
qubits. 

We examine how these differences affect the relative size of the QLA architecture required to implement 
each application. In particular, Figure [8] shows the performance of QLA-based quantum computers in KQ 
space with fixed physical resources. Each horizontal line corresponds to the KQ limit for a QLA-based 
architecture modeled for factoring a 1024-bit number (top-most horizontal dashed line), a 512-bit number, 
a 128-bit number, and an 8-bit number, respectively. The physical resources for each QLA-N quantum 
computer (where = {1024, 512, 128, 8} bits) are determined by how many logical qubits at level 2 
error correction are required to implement the Quantum Carry Look-ahead Adder (QCLA) factoring circuit 
112, l^]; which requires 0{N'^) logical qubits and 0{N log^N) logical cycles. The plateaus in each 
QLA-A^ line of Figure [8] represent using all of the qubits at a specific level of encoding, with the top-most 
right-hand plateau representing level 1. Where the lines are sloped, the model is that only a certain number 
of the lower level encoded qubits can be used. Once this reaches the number of qubits that can be encoded 
at the next level, the quantum computer is switched from encoding level L to L + 1 by using all the available 
level L qubits. 

Figure [8] shows that a QLA- quantum computer is capable of executing an application using level L 
encoded qubits if the application instance is mapped underneath the line representing the computer at level 
L. Factoring a 1024-bit number, for example, falls directly on the level 2 portion of the QLA- 1024 line 
(see the square markers). Anything above that line cannot be implemented with the QLA-1024 computer. 
Similarly, factoring a 128-bit number maps under the QLA-128 line, but can be accomplished using level 
1 qubits. The TIM problem is mapped onto Figure [8] for N = 50, 100, 150 and several binary precision 
instances: M = {5, 10, 15, 20, 25}. As expected, factoring requires many more logical qubits, however, 
both applications require similar levels of error correction. A decimal precision of up to 4 digits of accuracy 
(M = 15) can be reached by using a quantum computer capable of factoring an 8-bit number at level 2 
error correction, however higher precision quickly requires level 3 error correction. 

The resources for implementing quantum factoring with one-control-qubit were calculated following the 
circuit in Figure [T] where the unitary gates are replaced with the unitary gates corresponding to modular 



exponentiation, as discussed in Reference [45(1. The results are shown with the diamond-shaped markers in 
Figure [8] While this particular implementation is the least expensive factoring network in terms of logical 
qubits, the high precision requirement of M = 0{N) makes this network very expensive in terms of 
time steps. In fact, the number of time steps required pushes the reliability requirements into level 4 error 
correction for factoring even modestly-sized numbers. 
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VI. CONCLUSION 

In this paper, the TIM quantum simulation circuit was decomposed into fault-tolerant operations and we 
estimated the circuit's resource requirements and number of logical cycles K as a function of the desired 
precision M in the estimate of the ground state energy. Our resource estimates were based on the QLA 
architecture and underlying technology parameters of trapped ions allowing us to estimate both K, as a 
function of the level of the error correction level, and the total length of the computation in real-time. 

Our results indicate that even for small precision requirements K is large enough to require error cor- 
rection. The growth of K is due to its linear dependence on the the Trotter parameter k, which scales 
exponentially with the maximum desired precision M. In order for K to scale polynomially with the 
precision, new quantum simulation algorithms are required or systems must be chosen where the phase 
estimation algorithm can be implemented without the Trotter formula. The linear dependence of the num- 
ber of time steps on k is due to the fact that Ux and the Uzz do not commute. However, there are some 
physical systems, whose Hamiltonians are composed of commuting terms, such as the nontransversal clas- 
sical Ising model, which has a solution to the partition function in two dimensions but is NP-Complete for 
higher dimensions j^^. In those cases, Trotterization is unnecessary. In future work, we intend to general- 
ize the calculations of the resource requirements to other physical systems and consider different ways to 
implement the phase estimation algorithm that Umit its dependence on the Trotter formula. 
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